,tsp Xhol Start codon 

G7T7I^\TACC£2£2£SG7TT7AAAT ATG TCT GTT GCC TTG TTA TCG GTT G77 TCT OCT TGT 

Kec Ser Val Ala Leu Leu Tr? VaL Val Ser Pro Cys As 



Transit peptide of phytoene synthase 
GTC TCA AAT GGG AC A AGT TTC ATG GAA TCA GTC CGG GAG GO A AAC CGT 
^al Ser Asn GLy Thr Ser Phe Met Glu Ser Val Arg Glu Gly Asr. Arg 



Clal v ,XhoI 



BamHI 



Phvtoene Synthase 




TMVRNA (+7866) 
Kpnl 



pBR322 



TMV RNA 



126K 



63 m 



SP6 promoccr 



Figure 1 



BamHI 




pBR322 



Figure 2 



.Lsp 
GTTT 



Xhol Start codon 

rACG CTCGAG TGCAGC ATG ACC CTT CTA AAG CCT TTT CCA TCT CCT TTA CTT ' 

Me- Glu Thr Leu Leu Lys Pro Phe Pro 5er Pro Leu Leu 



Transit peptide of capsanthin-capsorubin synthase 
TCC ATT CCT ACT CCT AAC ATG TAT AGT TTC AAA CAC AAC TTC ACT TTT 
Ser ILe Pro Thr Pro Asr. Met Tyr Ser Phe Lys His Asn Ser Thr Phe 



Clalv (Xhol Capsanthin-capsorubin Synthase 



BamHI 




TMV RNA (+8095) 
Kpal 



pBR322 



TMV RNA 



126K 



63 m 



SP6 promoter 



Figure 3 



Nael(130) 




KS+/TVCVK#23 

45 16 bp 



SicU(66i) 

Sacl(657) 

BstXI(6<35) 

Nocl(670) . 

Xbal(677) 

Sper(633) 

BamHI(<589) 

Smal(697) 

PstIC705) 

EcoRIaOT) 

EcoRV biuntC7l5) 



PvuII(2529) ' 



HindEI(227; 

SalI(22S6V 

XhoI(2292) 

KpnIC23ll)' 



EcoRV blunt(226" 



Figure 4 



.Lsp Xhol Start codon 

r.~-~r VAATACC --TCGAC CC ATG GCT TCC TCA CTT CTT TCC TCT CCA CCA GTT CCC ACC CCC ~ 

Mac Ala Ser See Val Leu Ser Ser Ala Aid Val Ala Thr Arj 



RUBISCO SSU Chloroplast Transit Peptide (.V. Miarum) 

ACC AAT GTT CCT CAA GCT AAC ATG GTT GCA CCT TTC ACT GCC CTT 
Ser Asn Val Ala Gin Ala Asn Kec Val Ala ?ra ?fca Thr Gly Leu 



"ciaTT 



BaxnHI 



Phvtoene Synthase 
Xhol 




TMV RNA (+6734) 
Kpnl 



pBR322 



TMV RNA (+1) 



126K 



68 nt 



SP6 promoter 



Figure 5 




Figure 6 




Nar I 10596 
Kas I 10596 
Ehe I 10596 




ScM 6665 

Pad 5757 

Bsp120 I 5455 
Apa I 5455 



Figure 7 




Narl 9499 



Kasl 94S9 256 Sma I 

Ehe I 9499 256 Xma t 

Bbel 9499 303 Bsm 1 




Pmll 5320 
Xha I 5769 



PaeR7l 5759 
Eagl 5763 
Nod 5762 

Nco I 5459 
Bspl20 I 5455 
Apa I 5455 



Figure 8 



Xhol Surt codon 

GAG G ATG ATC AGA TTC TTA GTC CTC TCT TTG CTA ATT CTC ACC CTG TT 
Me: lie Arg Phe leu Val Leu Ser Leu Leu lie Leu Thr Leu ?h 



Signal peptide -I +1 Mature cx-trichosanthin 
^CTA ACA ACT CCT GCT GTG GAG GGC ( GAT GTT AGC TTC CGT TTA TCA 
Leu T^r Thr Pre Ala Val Glu Qly \ Asp Val Ser ?he Ar7 -eu Ser 



BamHI 



q-trichosanthin 

AvrU 




Kpnl 



pBR322 



WRNA(+I) 



.26K 



63 nr 



SP6 promoter 
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220 Dra III 
223 BsaA I 

326 Nae I 



Xmn I 3515 



Sea 



Fspl 3140 

Ssa! 2987 
And I 2915 "i 




|— 982 Bst1 107 I 
S32 Snal 
982 Xcal 
' 1108 Mlu I 
1 1 54 BsrG I 
1201 Ndel 
12C8 BspEI 
1251 Msc I 
13C9 Hind 



AlwN I 2438 



1437 Sph ! 
1439 Nsi I 
1522 Afl II 
1584 Pstl 
1 629 Acc55 I 
1529 Asp7l8 
1629 Kpn I 



Figure 10 



10339 
10339 
10339 
10389 



256 Sma I 
256 Xma I 
308 Bsm I 

LL 



981 SexA I 
1130 EcoN t 



And I 



55-d337 t 7902 - 
K. 2 n f 7355 



9st£ II 71 14 




1675 Stu I 



Xcm ! 



Pad 5757 

8spl20 I 5455 
Apa l 5455 
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Sacl(6978) 




Smal(5090) 



Figure 12 




Nucleotide sequence of 740 AT #2441 



ct tea ctt teg ccg 

ATG GCT CTA CCT AAC CAG CAA ACC GTG GAT TAC CCT AGO TTC AAG CTC GTT ATC GTT GGC 
MALPNQQTVDY.PSFKLVIVG 

GAT GGA GGC ACA GGG AAG ACC ACA TTT GTA AAG AGA CAT CTT ACT GGA GAG TTT GAG AAG 
DGGTGKTTFVKRHLTGEFEK 

AAG TAT GAA CCC ACT ATT GGT GTT GAG GTT CAT CCT CTT GAT TTC TTC ACT AAC TGT GGC 
KYEPTIGVEVHPLDFFTNCG 

AAG ATC CGT TTC TAC TGT TGG GAT ACT GCT GGC CAA GAG AAA TTT GGT GGT CTT AGG GAT 
KIRFYCWDTAGQEKFGGLRD 

GGT TAC TAC ATC CAT GGA CAA TGT GCT ATC ATC ATG TTT GAT GTC ACA GCA CGA CTG ACA 
GYY~IHGQCAIIMFDVTARLT 

TAC AAG AAT GTT CCA ACA TGG CAC CGT GAT CTT TGC AGG GTT TGT GAA AAC ATC CCA ATT 
YKNVPTWHRDLCRVCEN IP I 

GTT CTT TGT GGG AAT AAA GTT GAT GTG AAG AAC AGG CAA GTC AAG GCC AAG CAG GTA ACA 
VLCGNKVDVKNRQVKAKQVT 

TTC CAC AGG AAG AAG AAC CTC CAG TAT TAC GAG ATA TCT GCC AAG AGC AAC TAC AAC TTC 
FHRKKNLQYYE ISAKSNYNF 

GAG AAG CCA TTC TTG TAC CTT GCT AGA AAG CTC GCC GGG GAC GCT AAT CTT CAC TTT GTG 
EKPFLYLARKLAGDANLHFV 

GAA TCA CCT GCC CTT GCT CCC CCG GAA GTT CAA ATC GAC TTG GCT GCT CAG CAG CAG CAT 
ESPALAPPEVQIDLAAQQQH 

GAG gCG GAG CTT GCA GCA GCA GCA AGT CAG CCA CTT CCT GAT GAC GAT GAT GAC ACC TTC 
EAELAAAASQPLPDDDDDTF 

GAG TAG AGA AAG AGA GAT GTG ATC TGT CAC TGA TTA CCC GTT AGG GCT TGT CTG AAC TTT 
E 

TTT TTG TTC ATG GTG CTA TTT TTA TGT GTC CGT ACT TTG AAA TGA ATC GAT GAC ATT AGT 



AAT TTT CAT TTT TAA GTT TTT AAC TGT CGC TAT GAA AGT GAA AAC 



Figure 13 



740 AT #2441 
AFO 17991 
740 AT #2441 
.AFO 17991 
740 AT #2441 
AFO 17991 
740 AT #2441 
.AFO 1799 1 
740 AT #2441 
.AFO 17991 
740 AT #2441 
AF017991 
740 AT #2441 
AFO 17991 
740 AT #2441 
AFO 17991 
740 AT #2441 
AFO 17991 
740 AT #2441 
.AFO 17991 
740 AT #2441 
.AFO 17991 
740 AT #2441 
AFO 17991 
740 AT #2441 
.AFO 17991 



Nucleotide sequence alignment of 740 AT #2441 to AF017991 
A. thaliana salt stress inducible small GTP binding protein Rani 

1 CTTCACTTTCGCCGATGGCTCTACCT 60 

MIIIIMIMMIIIIIIIIIIIMIMMIIIIIIilMMIMIIMIIMIIIMI 

67 CTTCACTTTCGCCGATGGCTCTACCTAACCAGGAAACCGT^^ 125 

61 TC GTTATCGTTGGC GATGGAGGCACAGGGJLAGAC CACATTTGTAAAGAGACATC TTACTG 120 

MIIIMIIIIIIIIMIIIMIIIIIIIIIIIIMIIMIIIIMIIIIIIMIIIIM 

127 TC GTTATCGTTGGC GATGGAGGCACAGGGAAGA2 CACATTTGTAAAGAGACATC TTACTG 186 

121 GAGAGTTTGAGAAGAAGTATGAAC C C ACTATTGGTGTTGAGGTTC ATCC TC TTGATTTC T 130 

IIIIIMIIMMIIIIINMMIIIIIIIMIIIIIIIIIIMIIIIII 

137 GAGAGTTTGAGAAGAAGTATGAAC C CAC TATTGGTGTTGAGGTTC ATC C TC C TGATTTC T 24 6 
181 TCAC TAAC TGTGGC AAGATC C GTTTC TAC TGTTGGGATAC TGCTGGC CAAGAGAAATTTG 240 

IIMIIMIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

247 TCAC TAACTGTGGCAAGATC C GTTTC TAC TGTTGGGATAC TGCTGGC CAAGAGAAATTTG 306 



-241 ' GTGGTCT^GGs^AJGGTTACTACATCOAT^ 200 

IIIIIIMIIMIIIIIIMMIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIII 

307 GTGGTC TTAGGGATGGTTAC TACATC CATGGACAATGTGC T ATCATCATGTTTGATGTC A 366 
301 CAGCAC GAC TGACATACAAGAATGTTC CAACATGGCACC GTGATC TTTGCAGGGTTTGTG 360 

MllllltlllllllMI IIMIIIIIIIIIIIIIIIIIIINIIIIIIIIIIIIIIII 

367 CAGCAC GAC TGACATACAGGAATGTTC CAACATGGCACC GTGATC TTTGCAGGGTTTGTG 426 
361 AAAACATCCCAATTGTTCTTTGTGGGAATAA^ 420 

IIIIIIIIIMIIIllllllllllllllIlllillllllllllllllllllilllllMI 

427 AAAACATCCCAATTGTTCTTTGTGGGAA^ 436 
421 C CAAGCAGGTAACATTC CACAGGAAGAAGAACC TC CAGTATTAC GAGATATC TGCCAAGA 4 80 

IIIIIIIIIIIIIMIIIIIIIIIII II I IlillllllMIIIIIIIIIIIIIIIM 

487 C CAAGCAGGTAACATTC CACAGGAAGGAGGAACTC CAGTATTAC GAGATATC TGC CAAGA 546 
481 GCAACTACAAC TTC GAGAAGCCATTC TTGTACCTTGCTAGAAAGC TCGC C GGGGACGCTA 5 40 

illlllllllilllMIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIM 

547 GCAACTACAAC TTC GAGAAGC C ATTC TTGTACCTTGC TAGAAAGC TC GC C GGGGAC GCTA 606 
541 ATCTTCACTTTGTGGAATCACCTGCCCTTGCTC^ 600 

IMIIIMMMIIIIIIIIIMIIIIIIMIIIMIIIIIIIMIIIIIIIIMIIIII 

607 ATCTTCACTTTGTGGAATCLACCTGCCCTT^ 666 
601 CTCAGCAGCAGCATGAGGCGGAGCTTGCAGCAGCA^ 6 60 

IMIIIMIIIMIMIIIIIMIIMIIIMIIIIIIIIIIMIIMIIIIIIIIIIM 

667 C TCAGC AGCAGCATGAGGC GGAGC TTGCAGCAGCAGCAAGTCAGC CAC TTC C TGATGAC G 725 

661 ATGATGACAC C TTC GAGTAGAGAAAGAGAGATGTGATCTGTC AC TGATTAC C C GTT AGGG 720 

IIIMIIIIIIM MIlllillllllllllllllillllllllllllllllllilMIII 

727 ATGATGACAC C TTC GAGTAGAGAAAGAGAGATGTGATCTGTCA^ TGATTAC C CGTTAGGG 7 36 

721 CTTGTCTGAACTTTTTTT 738 

IIIMIIIIIMIIIIM 

787 CTTGTCTGAACTTTTTTT 804 
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Nucleotide sequence alignment of 740 AT #2441 to L1678" 
N. tabacum small ras-like GTP-binding protein 



740 AT #2441 

LI 6787 

"40 AT #2441 

LI 6787 

740 AT #2441 

LI 6787 

740 AT #2441 

LI 6787 

740 AT #2441 

LI 6787 

740 AT #2441 

LI 6787 

740 AT #2441 

LI 6787 

740 AT #2441 

LI 6787 

740 AT #2441 

LI 6787 

740 AT #2441 

L16787 

740 AT #2441 

LI 6787 

740 AT #2441 

LI 6787 



15 
37 
75 
97 
135 
15 7 
195 
,217 
j 

277 
315 
337 
375 
397 
435 
457 
495 
517 
555 
577 
615 
637 
675 
697 



ATGGCTCTACCTAA-CCAGCAAAC^ 74 

illlimill ilili Mill li Mill II IMMMMM M MM! M! 

ATGGC TC TAC C AAAC C AAC AAAC TGT AGATTATC C AAGC TTCAAGC TTGT AATC GTGGGC 9 5 



GATGGAGGC AC AGGGAAGAC CACATTTCTi^AAGAGACATC TT AC TGGAGAGTTTGAGAAG 134 

i i [ I 1 1 1 I M Mill I I II I I ( 1 1 Mill IMMMMM M 1 i 1 ! 1 1 II 1 

GATGGAGGAACTGGGAAAACAACTTTTGTCAA 15 6 

AAGT ATGAAC C C AC TATTGGTGTTGAGGTTC ATC C TC 1TGATTTC TTCAC TAAC TGTGGC 194 

II I [ i I i i I M M I I M I I I 1 I IMMMMM ! M MIMIII ! I Mill 

AAAT ATGAAC C C AC TATTGGTGTGGAGGTTC ATCC ATTAGAC TTC TTCAC AAATTGTGGG 215 
AAGATC C GTTTC TAC TGTTGGGAIAC TGC TGGCC AAGAGAAATTTGGTGGTC TTAGGGAT 254 

1 1 II 1 1 1 1 M II IMMMMIMM MINIM IMM I 1 1 I M Mill 

AAAATTC GC TTTTATTGCTGGGATAC TGC TGGA2AAGAGAAGTTTGGAGGTC TTC GGGAT 276 
GGTTACTJ\CATCCATGGACAATGTGCTATCtt 3 1 4 

MIMIIIMI IMM Mill II II IMMMMIMM Mill II IMM 

GGTT AC T ACATTCATGGGCAATGC GC AATTATGATGTTTGAT C C GTC TGAC C 3 36 

TACAAGAAXGTTCC AACATGGCAC C GTGATC TCTGCAGGGTTTGTGAAAACATCC CAATT 374 

MIMIIIMIMI II IMM M Mill 1 1 II II M I II II 1 1 1 1 M I M I Ml 

TACAAGAATGTTCCTACGTGGCATCGAGATC 3 96 

GTTC TTTGTGGGAATAAAGTTGATGTGAAGAACAGGCAAGTC AAGGC CAAGC AGGTAAC A 4 34 

IIIMIIIIM II MIMIIIMI IIIIIMIIII II Mill Mill II II 

GTTCTTTGTGGAAACAAAGTTGATGTCAAGAACAGGCA^ 45 6 

TTCCACAGGAAGAAGAACC TC CASTATTAC GAGATATCTGC CAAGAGCAAC TACAAC TTC 4 94 

IMIIIMMIMI II I II M II Mill II M IMM IIMMIMM 

TTCCACAGGAAGAAAAATTTGCAATAC TATGAGATC TCAGC AAAGAGTAAC TACAAC TTT 516 
GAGAAGC CATTC TTGT AC C TTGC T AGAAAGC TC GC C GGGGAC GC TAATC TTCAC TTTGTG 5 54 

MIIIMI II MMIIIMI IMIMM II Mill IMMIIIIIIMIMM 

GAGAAGC CTTTTC TGT AC C TTGCCAGAAAGC TTGC TGGGGATGC TAATC TTC AC TTTGTG 5 76 
GAATCACCTGCCCTTGCTCCCCCGGAAGTTCAAATC ^14 

MIMIIIMI IMMMMM Mill Mill II II Mill Mill I MM 

GAATCACCTGCACTTGCTCCCCCTGAAGTACAAAT^ 6 36 

GAGGCGGAGCTTGCAGCAGCAGCAAGTCAGCCACTTCCTC^ 6 74 

II Milll Ml II M IMMII I M 1 1 I! ! I II M I ! M 

GAACAAGAGCTTTTGC AAGC C GCTGC GCAC GC AC TTCCAGATGAC GATGATGAAGC TTTT 6 96 
GAGTAGA 681 

II MM 

GAATAGA 703 
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Amino acid sequence comparison of 740 AT #2441 to tobacco RAN-B 1 GTP binding protein 

Nt RAN- 31 MALPNQQTVD YPSFKLVIVGDGGTGKTTFVKRHLTGE 

llll!MIIIIIMIIIIIM!IIMIIM!lllll I I I I II I I I I! I I I !! I I I 

740 AT #2441 MALPNQQTVD YPS FKL VI VGDGGTGKTTFVKRKLTGEFEKXYSPTIGVE VHP LDFFTNC 



Nt RAN -B 1 KIRFYCWDTAGQEKFGGLRDGYYIKGQCAIIMFDVTSTTDIQECSNMAP*SLQGL*K>'3Q 

I I I I i I I I I I i I I I I I I I II I I I I I I I I I | | | | | | | + + | 

740 AT #24*1 KIRFYCWDTAGQEKFGGLRDGYYIKGQCAIIMFDVTARLTYKNVPTWHR-DLCRVCENIP 

Nt RAN -31- L F F VG I KLM * KNRQ VKAQ 

+ II* I I I I I I I* 
740 AT #2441 I VLCGNKVDVKNRQVKAK 
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Nucleotide sequence of 740 A l #1191 



GOT ACT ATG GTT GCC TCT CCG GCT CAG GCC ACT ATG GTC GCT CCT TTC AAC GC-A CTT AAG 

A T M V A S ? A Q A T M V A ? : N G Z t 

TCC TCC GCT GCC TTC CCA GCC ACC CGC AAG GCT AAC AAC GAC ATT ACT TCC ATC AC A AGC 

SSAAFPATRKANNDITS ITS 

AAC GGC GGA AGA GTT AAC TGC | ATG CAG GTG TGG CCT CCG ATT GGA AAG AAG AAG TTT GAG 

N GGRV N C | M Q V W p ? IGKK K F Z 

ACT CTC TCT TAC CTT CCT GAC CTT ACC GAT TCC GAA TTG GCT AAG GAA GTT GAC TAC CTT 
TLSYLPDLTDSELAXEVDYL 

ATC CGC AAC AAG TGG ATT CCT TGT GTT GAA TTC GAA GT 
I R N — KWIPCVEFEV 
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Nucleotide sequence alignment of 740 AT #855 to Arabidopsis thaliana 
HAT7 homeobox protein ORF (U09340) 



HAT7 



GAA ACG ACG TCG GCT AGT TAT TGG GCA TGG CCT GAC CAG CAG CAA 
ETTSASYW'AWPDQQQ 



HAT7 

HAT7 
HAT7 



CAA CAT CAC AAT CAT CAT CAG TTC AAT TGA tcatattgtctaagaaca 
QHHNHHQFN* 

acatcatactcatcttgatatcattattatcatcaaaagaaaattccgtagattttttta 
ataagtattttcaaattatttggcacgtttaaaattaattaaattgggttattatgttt 



740 AT #855 acttgatctgtttcatactaaaaccaaaactcatgtttgttcactccaaacacaaacaca 

-~! 1 1 M 1 1 1 M 1 1 1 1 1 1 I 1 1 M 1 1 1 1 1 1 1 1 M 1 1 ! I MM M I M 1 1 1 M M I M 1 1 II 1 1 

HAT7 _ acttgatctgtttcatactaaaaccaaaactcatgtttgttcactccaaacacaaacaca 



740 AT #855 gcagtaatcaaaaatcgtcttataacaaaaaggaaatgcaacaaaacagaagaaacaact 

1 ! M I I I I i : ( 1 I I I I [ M I I I I I I I I : I I [ 1 1 I i : M I I I I I I M I ! : 1 M I i i : M 1 I 

HAT7 gcagtaatcaaaaatcgtcttataacaaaaaggaaatgcaacaaaacagaagaaacaact 



740 AT #855 aagtagtaggcaagattcttcttcactcgtcttcttggctacggagcc 

IMMMIMI III M MIMMMII I MIMMMIIMMMM 

HAT7 aagtagtaggcaagattcttcttcactcgtcttcttggctacggagcc 
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# 



Nucleotide sequence alignment of 740 AT #909 to H, sapiens S56985 ribosomal protein L19 
[human, breast cancer cell line, MCF-7] 

740 AT 4909 1 GAAGCGGCTCGCCGCATCAGTGATGAAGTGCGGGAAGGGCAAAGTTTGGCTCGATCCCAA 60 

I ! I I I I I I I I I I I II I 1 I I I I I I II II III I II I I II I 

S569B5 3 2 GAAGAGGCTCGCCTCTAGTGTCCTCCGCTGTGGCAAGAAGAAGGTCTGGTT AGACCCCAA 9 1 

740 AT #909 61 CGAAAGCTCCGACATCTCCATGGCCAATTCCCGCCAAAACATCAGGAAGCTTGTGAAGGA 12 0 

I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I M I I I 
S56985 92 TGAGACCAATGAAATCGCCAATGCCAACTCCCGTCAGCAGATCCGGAAGCTCATCAAAGA 151 

740 AT #909 121 TGGTTTCATCATCAGGAAGCCAACCAAGATTCACTCTCGTTCCAGAGCTCGCAAAATGAA 13 0 

III I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

S56985 152 TGGGCTGATCATCCGCAAGCCTGTGACGGTCCATTCCCGGGCTCGATGCCGGAAAAACAC 211 

740 AT #909 181 GATTGCCAAGATGAAGGGTCGTCACTCTGGATACGGTAAGAGGAAGGGTACCCGTGAAGC 24 0 

I I I I Mill! I I I I II I I I I I I I I I I I I I I I I III 

S56985 212 CTTGGCCCGCCGGAAGGGCAGGCACATGGGCATAGGTAAGCGGAAGGGTACAGCCAATGC 2 71 

740 AT #909 241 TAGGTTGCCAACAAAGGTACTGTGGATGCGTAGGATGCGTGTTCTTAGGCGTCTGTTGAA 300 

I I I I I I I I I I I I M I I I I I I I II I III I II III I I 

S56985 272 CCGAATGCCAGAGAAGGTCACATGGATGAGGAGAATGAGGATTTTGCGCCGGCTGCTCAG 3 31 

740 AT #909 301 GAAATACAGAGAGACGAAGAAGATTGACAAGCACATGTACCATGACATGTACATGCGTGT 360 

I MM I II I II I I I I I I I I I I II I II I II I lllll II II 

S56985 332 AAGATACCGTGAATCTAAGAAGATCGATCGCCACATGTATCACAGCCTGTACCTGAAGGT 3 91 

740 AT #909 361 TAAGGGTAATGTGTTCAAGAACAAGCGTGTCTTGATGGAGAGTATCCACAAGTCAAAGGC 42 0 

I I I I I II II II I I I I I I II li II I I I I I I I I I I I I I II I I Mill 

S56985 392 GAAGGGG AATGTGTTCAAAAACAAGCGGATTCTCATGGAACACATCCACAAGCTGAAGGC 4 51 

740 AT #909 421 TTAGAAGCTAGGGGAGAA 438 

I II I I MM 

S56985 452 AGACAAGGCCCGCAAGAA 46 9 
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Amino acid sequence alignment of 740 AT #909 to human P14118 60S ribosomal protein L19 



740 AT #909 1 KRLAASVMKCGKGKVWLDPNES SD I SMANSRQNIRKLVKDGFI IRKPTKIHSRSRARKMK 60 

KRLA+SV + + CGK KVWLDPNE+++ I +■ ANSRQ IRKL+KDG IIRKP +HSR+R RK 
Human P14118 8 KRLASSVLRCGKKKVWLDPNETNEIANANSRQQIRKLIKDGLI IRKPVTVHSRARCRKNT 67 

740 AT #909 61 IAKMKGRHSGYGKRKGTREARLPTKVLWMRRMRVL^ 120 

+A+ KGRH G GKRKGT AR + P KV WMRRMR + LRRLL + + YRE + KKID+HMYH +Y ++V 
Human P14118 63 LARRKGRHMG IGKRKGTANAi^MPEKVTWMRR>IR ILRRLLRR YRES KKI DRHMYHS L YLKV 127 

740 AT #909 121 KGNVFKNKRVLMES IHKSKA*KLGEK 146 

KGNVFKNKR + LME IHK KA K + K 
Human P14118 12 8 KGNVFKNKR I LMEH I HKLKADKARKK 153 
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